IRIS: A Protégé Plug-in to Extract and Serialize Product Attribute Name-Value Pairs
نویسنده
چکیده
This article introduces IRIS wrapper, which is developed as a Protégé plug-in, to solve an increasingly important problem: extracting information from the product descriptions provided by online sources and structuring this information so that is sharable among business entities, software agents and search engines. Extracted product information is presented in a GoodRelations-compliant ontology. IRIS also automatically marks up your products using RDFa or Microdata. Creating GoodRelations snippets in RDFa or Microdata using the product information extracted from Web is a business value, especially when you consider most of the popular search engines recommend the use of these standards to provide rich site data for their index.
منابع مشابه
Knowtator: A Protégé plug-in for annotated corpus construction
A general-purpose text annotation tool called Knowtator is introduced. Knowtator facilitates the manual creation of annotated corpora that can be used for evaluating or training a variety of natural language processing systems. Building on the strengths of the widely used Protégé knowledge representation system, Knowtator has been developed as a Protégé plug-in that leverages Protégé’s knowledg...
متن کاملSemi-Supervised Learning to Extract Attribute-Value Pairs from Product Descriptions on the Web
We describe an approach to extract attribute-value pairs from product descriptions on the Web. The goal is to augment product databases by representing each product as a set of such attribute-value pairs. Such a representation is useful for a variety of tasks where treating the product as a set of attribute-value pairs is more useful than as an atomic entity. Examples include product recommenda...
متن کاملExtracting and Using Attribute-Value Pairs from Product Descriptions on the Web
We describe an approach to extract attribute-value pairs from product descriptions in order to augment product databases by representing each product as a set of attribute-value pairs. Such a representation is useful for a variety of tasks where treating a product as a set of attribute-value pairs is more useful than as an atomic entity. We formulate the extraction task as a classification prob...
متن کاملSemi-Supervised Learning of Attribute-Value Pairs from Product Descriptions
We describe an approach to extract attribute-value pairs from product descriptions. This allows us to represent products as sets of such attribute-value pairs to augment product databases. Such a representation is useful for a variety of tasks where treating a product as a set of attribute-value pairs is more useful than as an atomic entity. Examples of such applications include product recomme...
متن کاملPlug - in for Protégé 2000 which supports Sesame
This paper presents a solution for a new plug-in named Passerelle for Protégé 2000. Passerelle makes it possible to connect Protégé to Sesame architecture, to store and query RDF(s) data. With Passerelle, Protégé becomes a stronger ontology editor. It gives an ontology developer, the possibility of using RDF query Language (RQL) as it enables much stronger and advanced query possibilities than ...
متن کامل